QSAR workbench: automating QSAR modeling to drive compound design
نویسندگان
چکیده
We describe the QSAR Workbench, a system for the building and analysis of QSAR models. The system is built around the Pipeline Pilot workflow tool and provides access to a variety of model building algorithms for both continuous and categorical data. Traditionally models are built on a one by one basis and fully exploring the model space of algorithms and descriptor subsets is a time consuming basis. The QSAR Workbench provides a framework to allow for multiple models to be built over a number of modeling algorithms, descriptor combinations and data splits (training and test sets). Methods to analyze and compare models are provided, enabling the user to select the most appropriate model. The Workbench provides a consistent set of routines for data preparation and chemistry normalization that are also applied for predictions. The Workbench provides a large degree of automation with the ability to publish preconfigured model building workflows for a variety of problem domains, whilst providing experienced users full access to the underlying parameterization if required. Methods are provided to allow for publication of selected models as web services, thus providing integration with the chemistry desktop. We describe the design and implementation of the QSAR Workbench and demonstrate its utility through application to two public domain datasets.
منابع مشابه
Automating QSAR expertise
The Discovery Bus, a multi-agent software system designed for automating aspects of Molecular Design, particularly expert decision making, is described. It extends approaches aimed at automating the processing of drug discovery information but where control remains with the human expert, to automating the " tacit knowledge " of the expert and best practice, which we model as a workflow, and exp...
متن کاملMachine Learning for Drug Design
A common step in drug design is the formation of a quantitative structure-activity relationship (QSAR) to model an exploratory series of compounds. A QSAR generalizes how the structure of a compound relates to its biological activity. There is growing interest in the application of machine learning techniques in QSAR modeling research. However, no single technique can claim to be uniformly supe...
متن کاملPharmacophore and 3D-QSAR Characterization of 6-Arylquinazolin-4-amines as Cdc2-like Kinase 4 (Clk4) and Dual Specificity Tyrosine-phosphorylation-regulated Kinase 1A (Dyrk1A) Inhibitors
Cdc2-like kinase 4 (Clk4) and dual specificity tyrosine-phosphorylation-regulated kinase 1A (Dyrk1A) are protein kinases that are promising targets for treatment of diseases caused by abnormal gene splicing. 6-Arylquinazolin-4-amines have been recently identified as potent Clk4 and Dyrk1A inhibitors. In order to understand the structure-activity correlation of these analogs, we have applied lig...
متن کاملCoden : Ijpnl 6 Qsar Modeling Studies on 2 , 4 - Thiazolidinediones as Potential Α - Glucosidase Inhibitors
A linear quantitative structure-activity relationship (QSAR) model is presented for modeling and predicting the αglucosidase inhibitory activity. The model was produced by using the multiple linear regression (MLR) technique on a twenty one compound database that consists of newly discovered 2,4-thiazolidinediones. The major conclusion of this study is that molecular weight, wiener index, andre...
متن کاملExploring differential evolution for inverse QSAR analysis
Inverse quantitative structure-activity relationship (QSAR) modeling encompasses the generation of compound structures from values of descriptors corresponding to high activity predicted with a given QSAR model. Structure generation proceeds from descriptor coordinates optimized for activity prediction. Herein, we concentrate on the first phase of the inverse QSAR process and introduce a new me...
متن کامل